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Abstract 

We study the distributed detection problem in the context of a balanced binary relay tree, where the 
leaves of the tree correspond to N identical and independent sensors generating binary messages. The 
root of the tree is a fusion center making an overall decision. Every other node is a relay node that 
aggregates the messages received from its child nodes into a new message and sends it up toward the 
fusion center. We derive upper and lower bounds for the total error probability Pjy as explicit functions 
of N in the case where nodes and links fail with certain probabilities. These characterize the asymptotic 
decay rate of the total error probability as TV goes to infinity. Naturally, this decay rate is not larger 
than that in the non-failure case, which is y/~N. However, we derive an explicit necessary and sufficient 
condition on the decay rate of the local failure probabilities pk (combination of node and link failure 
probabilities at each level) such that the decay rate of the total error probability in the failure case is 
the same as that of the non-failure case. More precisely, we show that logP^ 1 = ®(y/~N) if and only 
if logp- 1 = f!(2 fe/2 ). 
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I. Introduction 

Consider a distributed detection system consisting of sensors, relay nodes, and a fusion center, net- 
worked as a directed tree with sensors as leaves and fusion center as the root. The objective of the 
system is to jointly make a decision between two given hypotheses. Each sensor sends a message to its 
parent node, which could be a relay node or the fusion center. Each relay node fuses all the messages 
received into a new message, and then sends the new message to its parent node, which again could 
be a relay node or the fusion center. Ultimately, the fusion center makes an overall decision about the 
true hypothesis. The goal is to analyze the detection performance by investigating the scaling law of the 
total error probability at the fusion center, as the number N of sensors goes to infinity. This problem 
has been studied in the context of various architectures. What distinguishes different architectures is the 
configuration of relay nodes between the sensors and the fusion center, which affects the performance. 

In the extensively studied parallel architecture 101- 11221 . every sensor sends a message directly to 
the fusion center without any relay nodes. In this case the detection performance is the best among all 
architectures, and the error probability at the fusion center decays to exponentially fast with respect to the 
number N of sensors. However, this configuration is not energy-efficient in a large-scale sensor network 
because sensors located far away from the fusion center have to spend more power to communicate reliably 
to the fusion center. The energy consumption can be largely reduced by setting up a tree architecture 
with intermediate relay nodes, although the detection performance in a tree architecture cannot be better 
than that in a parallel architecture. 

The bounded-height tree architecture has been considered in |[22l - |[32l . Under the Neyman-Pearson 
criterion, the error probability at the fusion center decays exponentially fast to with the same exponent as 
that of a parallel architecture ll24ll . Under the Bayesian criterion, the total error probability Pjy at the fusion 
center still decays exponentially fast with respect to the number N of sensors, i.e., logP^ 1 = Q(N), 
but with an exponent that is smaller than that of the parallel configuration [27][j]ln either case, the 

1 We use the following notation in characterizing the scaling law of the asymptotic decay rate. Let / and g be positive functions 
defined on positive integers. We write f(N) — 0(g(N)) if there exists a positive constant c\ such that f(N) < cig(N) for 
sufficiently large N. We write f(N) = Sl(g(N)) if there exists a positive constant C2 such that f(N) > c%g{N) for sufficiently 
large N. We write f(N) = Q(g(N)) if f(N) = 0(g(N)) and /(TV) = n{g(N)). For N -> oo, f(N) = o(g(N)) means that 
f(N)/g(N) -> and f(N) = u(g(N)) means that f(N)/g(N) -> oo. 



March 6. 2013 



DRAFT 



3 



bounded-height tree architecture does not significantly compromise the detection performance relative to 
the parallel configuration in terms of the exponential decay of the total error probability. Note that log 
stands for binary logarithm throughout this paper. 

Tree architectures with unbounded height have been considered in |[33l - |[35l . In particular, Gubner et 
al. 11331 consider a balanced binary relay tree of the form shown in Fig. [T] In this configuration, the leaf 
nodes depicted as circles are identical sensors, which send binary messages to their parent nodes at the 
next level. Each node depicted as diamond is a relay node which fuses the two binary messages received 
from its child nodes into a new binary message and sends it upward. Ultimately, the fusion center at the 
root makes an overall decision. If the number of arcs in the path from a node to the nearest sensor is k, 
then this node is said to be at level k. 

The balanced binary relay tree architecture is of interest because it is the worst-case scenario in the 
sense that the minimum distance from the sensors to the fusion center is the largest (the fusion center 
is maximally far away from the sensors). Tree networks with unbounded heights arise in a number 
of practical situations. Consider a wireless sensor network consisting of a large number of spatially 
distributed sensors. Due to limited sensing ability, we wish to aggregate the distributed information 
into a fusion center to jointly solve a hypothesis testing problem. Typically, each sensor has also a 
limited power for processing and transmitting information. As mentioned before, the energy consumption 
for transmitting information can be significantly reduced by setting up a tree architecture. Moreover, 
the assumption of unbounded heights and moderate degrees is natural for interference-limited wireless 
networks. In particular, systems in which a nonleaf node communicates with a significant fraction of 
nodes are likely to scale poorly because of interference. Another application is in social learning in 
multi-agent social networks, where each node represents an agent. Each agent interacts and exchanges 
information with its neighboring agents, and makes a decision about the underlying state of the world. 
Hierarchical tree architectures are common in enterprises, military hierarchies, political structures, and 
even online social networks. Also, it is well known that many real-life social networks are scale-free: 
The degree of each node is bounded with high probability. Therefore, it is of interest to consider the 
distributed decision making problem in tree architectures with bounded degree and hence unbound height. 

We assume that the sensors are conditionally independent in this configuration, and that all nonleaf 
nodes use the same fusion rule: the unit-threshold likelihood-ratio test [37]. Under these assumptions, 
Gubner et al. [33 ] show the convergence of the total error probability to using Lyapunov methods. Under 
the same assumptions, in 041 we derive tight upper and lower bounds for the total error probability at the 
fusion center as functions of N. These bounds reveal that the convergence of the total error probability 
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at the fusion center is sub-exponential with exponent y/N, i.e., logP N = Q(y/N) 



Fusion 
Cen ter 





N=2 h 



Fig. 1. A balanced binary relay tree with height h. Circles represent sensors which send binary messages. Diamonds represent 
relay nodes which fuse binary messages. The rectangle at the root represents the fusion center making an overall decision. 



The assumption in [331- 11351 is that all messages are transmitted reliably in perfect channels. However, 
in practical scenarios, the nodes are failure-prone and the communication channels are not perfect, wherein 
messages are subject to random erasures. The literature on distributed detection problem in tree networks 
with node and link failures is quite limited. Tay et al. |32l provide an asymptotic analysis of the impact 
of imperfect nodes and links modeled as binary symmetric channels in trees with bounded height using 
branching process and Chernoff bounds. However, the detection performance for unbounded-height trees 
with failure-prone nodes and links is still open. 

In this paper, we investigate the distributed detection problem in the context of balanced binary relay 
trees where nodes and links fail with certain probabilities. This is the first paper on performance analysis 
of unbounded-height trees with imperfect nodes and links. We derive non-asymptotic bounds for the total 
error probability Pjy as functions of N. These bounds in turn characterize the asymptotic decay rate of 
the total error probability. Naturally, one would expect that the detection performance in the failure case 
cannot be better than that in the non- failure case studied in |[34l . But are there conditions on the failure 
probabilities under which the total error probability for a tree with failures decays as fast as that for the 
tree with no failures? We answer this question affirmatively and derive an explicit necessary and sufficient 
condition on the decay rate of the local failure probabilities (combination of node and link failure 
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probabilities at level k) for this to happen. More specifically, the decay rate of the total error probability 
is still sub-exponential with the exponent y/N in the asymptotic regime, i.e., logi-^ 1 = Q(y/N), if and 
only if the local failure probabilities pk satisfies logp^ 1 = Q,(2 k l 2 ). 

II. Problem Formulation 

We consider the problem of binary hypothesis testing between Hq and Hi in a balanced binary relay 
tree with failure-prone nodes and links, shown in Fig. [2] (the notation there will be defined below). Each 
sensor (circle) sends a binary message upward to its parent node. Each relay node (diamond) fuses two 
binary messages from its child nodes into a new binary message, which is then sent to the node at the 
next level. This process is repeated culminating at the fusion center, where an overall binary decision 
is made. We assume that all sensors are conditionally independent given each hypothesis, and that all 
sensor messages have identical Type I error probability ao (also known as probability of false alarm) 
and identical Type II error probability (3q (also known as probability of missed detection). Moreover, we 
assume that each node at level k fails with identical node failure probability (a failed node cannot 
transmit any message upward). We model each link as a binary erasure channel ll38l as shown in Fig. [3] 
With a certain probability, the input message X (either or 1) gets erased and the receiver does not get 
any data. We assume that the links between nodes at height k and height k + 1 have identical probability 
of erasure Ik- 




N=2 h 



Fig. 2. A balanced binary relay tree with node and link failures. 

Suppose that a node M or a link C in the balanced binary relay tree fails. Then equivalently, we can 
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remove the substructure below that node or link. Therefore, with a distribution given by the node and link 
failure probabilities, the system reduces to a random subtree of the balanced binary relay tree, which is 
unbalanced in general. Note that performance analysis of balanced binary relay trees with node and link 
failures essentially gives the expected performance for a family of random trees constructed by pruning 
a balanced binary relay tree. 

Consider a node A4 at level k connected to its parent node A4+i at level k + 1. We define several 
events as follows: 

• E^: the event that the node A4 does not have a message to transmit, i.e., Mt does not receive any 
messages from both its child nodes. We denote the probability of this event by Vk and we call it 
the starvation probability. 

• E^ : the event that either the node A4 fails or the link from A4 to Wfc+i fails. We call the occurrence 

(2) 

of E k a local failure and we denote by pk the local failure probability. 

• El': the event that A4+i does not receive a message from A4- We denote the probability of this 
event by qk and we call it the silence probability. 

Note that E^ occurs if and only if either (i) the node A/& does not have a message to transmit (event 
E^), or (ii) the node Mk does have a message to transmit but a local failure occurs (event E^ 1 ). The 
probability of case (i) is simply Vk- The probability of case (ii) is pk, which equals the conditional 
probability of E^ given E^ (the complement of the event E^\ which means that A/fc has a message 
to transmit). Thus, 

Pk = P(E^)=P(E^\E^) 
= n k +£k- n k £k- 

By the law of total probability, we have 

qk = p(4 3) ) = + p(4 3) i4 1) ) p (4 1) ) 

= r k + pk(i-v k ). 

Consider the parent node Mk+i- This node does not have a message to transmit (event E^^) if and 
only if it does not receive messages from both its two child nodes. The probability Vk+i of this event is 

V k+ i=qi = (Vk+Pkil-Vk)) 2 . 
Recursively, we can show that the probability of the event that the parent node of Nk+\ does not receive 



March 6, 2013 



DRAFT 



7 



messages from A4+i is 

Qk+l = Vk+i +Pfc+i(l - Vk+l) 

= ql+Pk+i(i-ql), (i) 

where Pk+i = + ^fc+i — n k -\-i&k+i denotes the local failure probability for level k + 1. 




Fig. 3. A binary erasure channel with input X and output Y. The input message is erased with probability i. 

Denote the Type I and Type II error probabilities for the nodes at level k by a k and f3 k , respectively. 
Consider node A4+i at level k + 1, which possibly receives messages from its two child nodes. We have 
three possible outcomes: 

i. Wfc+i does not receive any message from each of the two child nodes. 

ii. Wfc+i receives a message from only one of the two child nodes. 

iii. Nk+i receives messages from both the two child nodes. 

In case i, if Nk+i does not receive any message from each of the two child nodes, then A4+i does 
not have any information for fusion. Therefore, we cannot define the Type I and II error probabilities 
associated with A/fe+i in this situation. The probability of this event is q\. 

In case ii, if the parent node A4+i receives data from only one of the child nodes, then the Type I 
and Type II error probabilities do not change since the parent node receives only one binary message 
and directly sends this message without fusion. The probability of this event is 2^(1 — q^), in which 
case we have 

(a k+1 ,/3 k+1 ) = (a k ,f3 k ). (2) 
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In case iii, if the parent node receives messages from both child nodes, then the scenario is the same 
as that in |[33l and If34"1 . The probability of this event is (1 — qk) 2 , in which case we have 



(a k +i, Pk+i) 



(l-(l-a k ) 2 ,Pl), ifa k <(3 k , 
(a 2 k ,l -(I- Pk) 2 ), ifa k >p k . 



Consider the expected Type I and Type II error probabilities conditioned on the event that the parent 
node receives at least one message from its child nodes (cases ii and iii), that is, given that the parent 
node has data. If a k < f3 k , then by ([2]) and ([3]), given that the parent node at level k + 1 has data, the 
expected Type I error probability after fusion is given by 

__ (1 - g k ) 2 (2a k - q|) + 2q k (l - q k )a k _ (1 - q k ){2a k - a\) + 2q k a k 
ak+l (1 - qk) 2 + 2q k (l - q k ) l + qk 

The expected Type II error probability after fusion is given by 

i = (l-q k ) 2 P 2 k +2q k (l-q k )p k = (1 - q k )f3 2 + 2q k fi k 
+ (1 - q k ) 2 + 2q k (l - q k ) 1 + q k 

By symmetry, we can calculate the Type I and II error probabilities in the case where a k > (3 k . Note 

that the recursion for (a k ,(3 k ) depends on the sequence {q k }, which is given by ([T]). 

We can summarize the above discussion with the following recursion: 

(afc+i, Pk+i, Qk+i) = f(ak,Pk,<lk)> 

where 

„ a , \ ( ^^h^ , i^fl^ f q 2 + (1 _ ql)pk+ \ f tf ak < phf 
f{a k ,f3 k ,q k ) := < ) ( (4) 

Recall that all sensors have the same error probability triplet (ao,flo, qo), where qo = po = uq + £q — 
nolo. Therefore, by the above recursion Q, all relay nodes at level 1 will have the same error probability 
triplet (ai,/3i,gi) = / (ao, A)> 9o) (where a± and j3\ are the expected error probabilities). Similarly we 
can calculate error probability triplets for nodes at all other levels. We have 

(a k+1 ,p k+1 ,q k+l ) = f(a k ,P k ,q k ), A; = 0,1,..., (5) 

where (a k ,/3 k ,q k ) is the error probability triplet of nodes at the kth level of the tree. 

Consider (a k ,(3 k , q k ) as a discrete dynamic system governed by ^ with p k as its input. Notice that the 
dynamic system depends on the exogenous parameters n k and t k only through p k . An example trajectory 
of this dynamic system is shown in Fig. Qa), with the local failure probabilities given by p k +i = p\. We 
observe that q k decreases very quickly to in this case. In addition, as shown in Fig. |4jb), the trajectory 
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approaches (3 = a at the beginning. When (afe,/3fc) approaches sufficiently close to the line (3 = a, the 
next pair (ak+i, Pk+i) is flipped to the other side of the line ft = a. This behavior is similar to the 
non-failure scenario, in which case there exists an invariant region in the sense that once the system 
enters the invariant region it stays in there 1341 . Is there an invariant region in the failure case where 
Pk 7^ 0? We answer this question affirmatively by precisely describing this invariant region in R 3 . 

Our analysis builds on and further develops the method in ll34l . We view the local failure probability 
Pk as an exogenous input to the dynamic system ([5]). In this case, the evolution of the dynamic system 
also depends on the exogenous input. In Section III, we show that the dynamic system enters and stays 
in an invariant region in R 3 given that p^ is a non-increasing sequence. Then in Section IV, under 
certain conditions on the exogenous input, we derive upper and lower bounds for the ratio of the total 
error probabilities associated with two steps of the dynamic system, from which we derive upper and 
lower bounds for the total error probability at the fusion center as functions of N. These bounds in turn 
characterize the asymptotic decay rate of the total error probability. Last, we discuss the relationship 
between the decay of the exogenous input p^ and the decay rate of the total error probability. 

III. Evolution of Type I, Type II, and Silence Probabilities 

Notice that the recursion Q is symmetric about the hyperplanes a + (3 = 1 and (3 = a. Thus, it 
suffices to study the evolution of the dynamic system only in the region bounded by a + f3 < 1, j3 > a, 
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and < q < 1. Let 

U := {(a, p, q) > 0\a + P < 1, /3 > a, and q < 1} 
be this triangular prism. Similarly, define the complementary triangular prism 

C:={(a,p,q)>0\a + P <1, P < a, and q < 1}. 
First, we introduce the following region: 

B :={(<*, /3, g) G < -<?/(l - g) + \A 2 + (1 - <?) 2 (2a - a 2 ) + 2g(l - q)a/(l - q)}. 

It is easy to show that if (ak,Pk,qk) E -B, then the next triplet (afc+i, gfe+i) jumps across the 
plane P = a away from (a k ,Pk, Ik)- This process is shown in Fig. |4jb) from to 1. More precisely, if 
{a k ,p k ,q k ) G U, then (a k ,Pk,qk) £ B if and only if (a k +i, Pk+i, qk+i) & C. In other words, B is the 
inverse image of £ in W under mapping /. 

Note that if the initial error probability triplet is outside B, i.e., (ao> Po, qo) El4\B, then before the 
system enters B, we have a&+i > ak and /3fc+i < Thus, the dynamic system moves toward the 
P = a plane, which means that if the number N of sensors is sufficiently large, then the dynamic system 
is guaranteed to enter B. 

Next we consider the behavior of the system after it enters B. If (ak,Pk,q k ) E B, we consider the 
position of the next pair (af.+i, Pk+u 9fc+i)> i- e -> we consider the image of B under /, which we denote 
by Rc- Similarly we denote by Ru the reflection of R/r with respect to P = a. This region is shown in 
Fig. [5] in the (a,P,q) coordinates. We find that 

Ru := {(a, P, q) E U\P < -a + 2(^ 2 + (1 - <Z 2 )« - g)/(l - ?)}• 

The sets Ru and B have some interesting properties. We denote the projection of the upper boundary 
of Ru and B onto the (a, P) plane for a fixed q by R q u and i? 9 , respectively. It is easy to see that if 
q\ < <Z2, then R u x lies above i?^ 2 in the (a,P) plane. Similarly, if q± < q2, then B qi lies above B q2 in 
the (a, /?) plane. Moreover, we have the following proposition. 

Proposition 1: B C .Rw 

The proof is given in Appendix A. Note that B and Ru share the same lower boundary p = a. Thus, 
it suffices to prove that the upper boundary of B lies below that of Ru for a fixed q, i.e., B q lies above 
R u in the (a, P) plane. The reader can refer to Figs, [f^a) and|6^b) for plots of the upper boundaries of 
Ru and B projected onto the (a, P) plane for two fixed values of q. 
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0.0 0.2* 0.4 




Fig. 5. Ru in the (a, j3, q) coordinates. 



Let us denote by R the region RyLiRc- Then, so far we have shown that if the tree height is sufficiently 
large the system enters R. Next we show below that R is an invariant region in the sense that once the 
system enters R, it stays there. 

Proposition 2: Suppose that {au Q , /3fc , qk ) £ R for some ko and the sequence {q^} is non-increasing 
for k > ko. Then, (a^, /3&, G i? for all fc > fco- 

Proof: Without loss of generality, we assume that (q^, Pk,Qk) £ -Rw- We know that Rc is the image 
of U in £. Thus if the next state (ctk+i, fik+i, Qk+i) £ £> then it must be inside Rc- We already have 
Qk+i < <Zfc> which indicates that Rjj +1 lies above in the (a,/3) plane. Moreover, for a fixed g, the 
upper boundary R u is monotone increasing in the (a, /3) plane. We already know that a^+i > au and 
< Pk- As a result, if the next state (afc+i, Afe+i, Qk+l) £ ^> tnen the next state is in fact inside 
Ru. Note that in Fig. |4jb), the dynamic system stays in a neighbor region of /3 = a after it gets close 
to p = a. 

m 

To study the asymptotic detection performance, we can simply analyze the case where the system 
lies inside the invariant region and stays inside it. We assume that {%} is a non-increasing sequence. 
We will show in the next section that without this assumption, the decay rate is strictly more slowly 
than that of the non-failure case. Note that {q^} is a sequence depending on the input pjf., which in turn 
depends on the exogenous parameters and £f.. Next we provide a sufficient condition for {q^} to be 
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Fig. 6. (a) Upper boundaries for Ru and B for q — 0.1. (b) Upper boundaries for Ru and B for q — 0.01. 
non-increasing. 

Proposition 3: Suppose that pu+i < £>fc for all k and gi < qo. Then, is a non-increasing sequence. 
Proof: The recursive for is g^+i = q\ + (1 ~~ 9fc)Pfc+l- Since {pk} is non-increasing, we have 



Notice that this recursion is simply a weighted sum of 1 and pt+i- From the initial condition that q\ < qo, 
it is easy to see that qk+i < qu using mathematical induction. 

■ 

Henceforth, we assume that p^ is non-increasing and therefore q^ is monotone non-increasing as well. 
Based on the above propositions, in the next section we study the reduction of the total error probability 
when the system lies in R to determine the asymptotic decay rate. 



In this section, we first compare the step-wise reduction of the total error probability between the 
failure case and non-failure case. Then, we show that the decay of the failure case cannot be faster than 
that of the non-failure case. However, we provide a sufficient condition such that the scaling law of the 
decay rate in the failure case remains the same as that of the non-failure case and we discuss how this 
sufficient condition is satisfied in terms of the input parameter p&. 



qk+2 = ql+i + (i - ql+i)pk+2 < ql+i + (l - q 2 k+l )p k +i- 



IV. Error Probability Bounds and Asymptotic Decay Rates 
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A. Step-wise Reduction and Asymptotic Decay Rate 

We will first consider the case where the prior probabilities are equal, i.e., P(Hq) = P{H{) = 1/2. 
We define P k = a k + P k to be (twice) the total error probability for nodes at level k. 

1) Step-wise Reduction: In this part, we show that in the failure case, the decay of the total error 
probability for a single step cannot be faster than that of the non-failure case. 

Proposition 4: Let 4+1 = 4+1 + 4+1 ^ e (twice) the total error probability at the next level from 
the current state q). Suppose that (a k ,P k ,qi) and (a k ,p k ,q2) 6W. If q\ < q2, then 

^fc+l - h k+l 

with equality if and only if a k = Pk- 

Proof: It is easy to show the following inequality 

2a k - a\ + p\ < a k + /3 k 

^-P k ~ ot\ < P k - a k 

holds in the region a k + P k < 1 and p k > a k . The equality is satisfied if and only if p k = a k . 
From the recursion described in d4l, we have 



T { q ) 1 - q T (0) 2g 
L kii = YT~q k + l + l + ~q^ ak + Pk) ' 



where 4+i = 2a k - a\ + P\. Notice that 



l + <7 1 + 



Therefore, we can write 4+1 = 7Ti4+i + (1 — ir\)(a k + P k ), where ni = (1 — gi)/(l + gi). Let 
7T2 = (1 — <72)/(l + 92)- Then, it is easy to see that iri > 7^. Thus, we have 

4+1 = ^4+1 + (i - + a) + (^2 - ^i)4°+i - fa - ^i)4+i 

< 7ra4°2i + (1 - 7ri)(a fc + + (vr 2 - 7r x ) A - (tt 2 - + = 4+}- 



From Proposition 4 we immediately deduce that if q > 0, then 4+i — 4+1 ■ ^ ms means that the 
decay of the total error probability for a single step is fastest if the silence probability is (non-failure 
case). In other words, for the failure case, the step-wise shrinkage of the total error probability cannot 
be faster than that of the non-failure case, where the total error probability decays to with exponent 
\fN [34]. In addition, we show in this section that the asymptotic decay rate for the failure case cannot 
be faster than that of the non-failure case. 
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2) Asymptotic Decay Rate: With the assumption of equally likely hypotheses, we denote (twice) the 
total error probability for nodes at the fusion center by P/v := Aogiv- Using Proposition |4j we provide 
an upper bound for logP^ , which in turn provides an upper bound for the decay rate. 

Theorem 1: Suppose that (ao, A)><7o) £ R- Then, 

logP- x < ^(logL^ + l)- 

The proof is given in Appendix B. Theorem [l] provides an upper bound for logP^ 1 . From this upper 
bound, it is easy to get an upper bound for the asymptotic decay rate. 
Corollary 1: Suppose that (ao,A))9o) e R- Then, logP^ = 0(y/N). 

Compared with the decay rate for the non-failure case, the rate in Corollary [j] is not faster than \/~N 
(note that the scaling law for decay rate for the non-failure case is exactly y/~N). This observation is 
unsurprising because the case where nodes and links are perfect has the best detection performance. But 
is it possible that the decay rate for the failure case remains yNl In the next section, we show that this 
is possible if the silence probabilities decay to sufficiently fast. We also characterize how fast the local 
failure probabilities need to decay to such that the decay rate for the total error probability remains 
y/N. 

B. Error Probability Bounds and Decay Rates 

In this section, we first give a sufficient condition for the ratio Lk+2/ Li to be bounded. Then, we 
derive upper and lower bounds for the total error probability at the fusion center for trees with even 
and odd heights, in the equal prior scenario. Under the sufficient condition, we show that the decay rate 
of the total error probability remains the same as that of the non-failure case. We will also discuss the 
non-equal prior scenario. 

Proposition 5: Suppose that (a^, Pk,Qk) £ R and is monotone non-increasing. If there exists C > 
such that qk < CLk, then the ratio Lk+2/ L\ is bounded as 

1 L k 

The proof is provided in Appendix C. The constant C in Proposition [5] gives the scale relation between 
the silence probabilities q^ and the total error probabilities Lk. Note that the upper bound of Lk+2/L\ in 
Proposition^ depends linearly on C. Therefore, the tightness of the upper and lower bounds for Lk+2/L\ 
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Fig. 7. (a) The ratio Lk+2/Lj. in B for C = 1. (b) The ratio Lk+2/Ll in -Rw \ -B for C = 1. Each line depicts the ratio 
versus /3fc for a fixed a^. 



depends on the constant C. If C = 0, then qt = for all k and the problem reduces to the non-failure 
case, where the ratio is bounded above by 2 (see Il34l0 . This represents the case where the 

bounds are the tightest. Figs. |7ja) and (b) show the behavior of Lfc +2 /L| in the regions B and Rjj \ B 
for the case where C = 1, i.e., < L^. This example provides a visualization of the two-step reduction 
of the total error probability. 

Proposition [5] establishes bounds on the reduction in the total error probability for every two steps. 
From these, we can derive bounds for log-P^ 1 for even-height trees, i.e., logiV is even. 

Theorem 2: Suppose that (ao, Po, qo) £ R and qu is monotone non-increasing. If there exists C > 
such that qk < CL^ for k = 0, 1, . . . , log N — 1, then for the case where log N is even, 

(log L 1 - log(6C + 2)) < log < (log L 1 + l) . 

Proof: If (ao,A)j<?o) £ -R and g& is non-increasing, then we have {oik, hiQk) £ R for k = 
0, 1, . . . , log N — 2. From Proposition [5j we have L^ +2 = a kL\, for = 0, 1, ... , log N — 2 and some 
a k G [1/2, 6C + 2]. Therefore, for k = 2, 4, . . . , log TV, we have 

2 2 (fc ~ 2)/2 r 2 fc/2 

ivfc — d(fc_ 2 )/2 ' °(jfc-4)/2 " " a ^0 > 
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where ai G [1/2, 6C + 2], i = 0, 1, . . . , (k — 2)/2. Taking logs and using /c = log N, we have 

log P N X = - log a (fc _ 2 )/2 - 2 log a (fe _ 4)/2 2^ 2 ^ 2 log a + ViV log L \ 

Notice that logL 1 > and -1 < logOj < log(6C + 2) for all i. Thus, 

logP/ < ^/iVlogLo 1 + = VN(logL Q 1 + 1) . 

Finally, 

logPjv 1 > -log(6C + 2)ViV + ViVlogLo 1 = Vn (log L Q 1 - log(6C + 2)) . 

■ 

Again note that the tightness of the bounds in Theorem [2] depends on the constant C. For example, 
if the silence probability is much smaller than Lj~ for all k, then C is small and the bounds are very 
tight. 

For odd-height trees, we need to calculate the reduction in the total error probability associated with 

a single step. For this, we have the following proposition. 

Proposition 6: If (ak,f3k,Qk) 6 U, then we have 

Lk+i . -i , Lk+i , -, 
—p^- > 1 and — — < 1. 

L k L k 

The proof is given in Appendix D. From Propositions [5] and [6] we give bounds for the total error 
probability at the fusion center for trees with odd height. 

Theorem 3: Suppose that (ao, Po, qo) € R and q^ is monotone non-increasing. If there exists C > 
such that qk < CL^ for k = 0, 1, ... , log N — 1, then for the case where log N is odd, 

(logL 1 - log(6C + 2)) < logiV < V2N (logL 1 + l) . 
The proof is similar to that of Theorem [2] and it is provided in Appendix E. 

Theorems [2] and [3J respectively, establish upper and lower bounds for logP^ 1 for trees with even and 
odd heights, for the case where hypotheses Hq and Hi are equally likely. For the case where the prior 
probabilities are not equal, i.e., P(Hq) ^ P(Hi), we can derive bounds for the total error probability in a 
similar fashion. Suppose that the fusion rule is as before, i.e., the likelihood-ratio test with unit-threshold. 
The total error probability at the fusion center is P/v = P(Pto)c i iogN + P{Hi)/3\ogN- Without loss of 
generality, we assume that P(Hq) < P(H\). We are interested in bounds for log-P^ 1 . 
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Theorem 4: Suppose that (ao, A), Qo) S R and is monotone non-increasing. If there exists C > 
such that g/j < CL^ for fc = 0, 1, ... , log N — 1, then for the case where log N is even, we have 



VJV(log Lq 1 - log(6C + 2)) + log P(tfi)- 1 < log P" 1 < ViV(log L 1 + 1) + log P(H )-\ 
For the case where log N is odd, we have 

^(log^ 1 -log(6C + 2)) + logP(F 1 )- 1 < logi^ 1 < ^(logLo 1 + 1) + log P(Pq)" 1 - 

Proof: First we consider the even-height tree case. Recall that P/v = £iogiV = aiogTV + Aog/y- We 

have 

P(#o)Pv < Pv = P(H )ai ogN + P(Pi)Ao g JV < P{Hi)P N . 

From the upper and lower bounds for logP^ 1 derived in Theorem |2j we can get the upper and lower 
bounds for logP^ : 

\ogP N l > logP(Pi)- 1 + logP^ 1 > logPCFx)- 1 + \/iV(logLo 1 " l°g(6C + 2)) 

and 

logP^ 1 < logP(Po)^ 1 +logP N 1 < logP(Po)" 1 + VNQogLo 1 + 1). 

For the odd-height tree case, we can mimic the proof using the bounds in Theorem 3. The details are 
omitted. 

■ 

These non-asymptotic results are useful. For example, given e G (0, 1), if we want to know how many 
sensors are required such that P/v < e, we can simply find the smallest N that satisfies the inequality in 
Theorem [2] i.e., 

y/N (log L 1 - log(6C + 2)) > log e" 1 . 

Hence we have 

loge -1 



N > 



>gL 1 -log(6C + 2) 
The growth rate for the number of sensors is B((log 1/e) 2 ). 

We now discuss the asymptotic decay rates. The system enters the invariant region R eventually if the 
height of the tree is sufficiently large. Therefore to consider the asymptotic decay rate, it suffices just to 
consider the decay rate when the system lies in R. In addition, the bounds in Theorems |2^[4] only differ 
by constant terms, and so it suffices to consider only the asymptotic decay rate for trees with even height 
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in the equal prior probability case. Moreover, when we consider the asymptotic regime, that is, N — > oo, 
the sufficient condition in Theorems |2]-|4j i.e., < CL^, can be written as = 0(L&). We have the 
following result. 

Corollary 2: Suppose that (ao, /?o, <7o) ^ R anc ^ Ik is monotone non-increasing. If g% = O(Lfe), then 
the asymptotic decay rate is logP^ 1 = Q(y/N). 

This implies that the decay of the total error probability is sub-exponential with exponent y/N . Thus, 
compared to the non-failure case, the scaling law of the asymptotic decay rate does not change when we 
have node and link failures in the tree, provided that the probabilities of silence q^ decay to sufficiently 
fast such that it is dominated by Lk in the asymptotic regime. 

C. Discussion on the Sufficient Condition 

We have shown that if qj. = 0{Lk), then the scaling law for the asymptotic decay rate remains the 
same as that of the non-failure case discussed in ll34l . Notice that the silence probability sequence {q^} 
depends on the local failure probabilities {pk}, which we regard as an exogenous input. Next we consider 
how the decay rate of pk determines the decay rate of q^. Recall that the recursion of is 

Qk+i =qt + {l-ql)p k +i- 

Since is non-increasing, the first term q\ decays at least quadratically fast to and (1 — qf.) /* 1 in 
the second term. Therefore, if p^ decays more slowly than quadratically, then the value of q^ linearly 
depends on 

Proposition 7: Suppose that the local failure probability sequence {pk} is non-increasing. Then, the 
decay rate of the total error probability remains \/JV, i.e., logP^ 1 = @(y/N), if and only if the decay 
rate of pk is not smaller than 2 fc / 2 , i.e., logp^T 1 = Q,(2 k l 2 ). 

Proof: By Corollary [IJ we have logi 3 ^ 1 = 0{y/~N). This together with monotonicity of P/v imply 
that logP^ 1 is either @(y/N) or o(VN). 

First we show that if logp^ 1 = 0(2 fc / 2 ), then logP^ 1 = Q(y/N). From Corollary [j] we know that 
the decay rate of the total error probability is not better than \/N, that is, logP^ 1 = 0(y/N). We divide 
our proof into three cases based on the decay rate of pk- If logp^" 1 = 0(2 fc ), that is, if pk decays at 
least exponentially fast with respect to 2 k , then we can easily show that qk = O(Lk). If Pk decays more 
slowly than the above rate and logp^ 1 = uj(2 k l 2 ), then for sufficiently large k we have 

Qk+i =ql + ( 1 - Qk)Pk+l < 2Pk+i- 
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In consequence, q k decays faster than the sequence 2p k and therefore it decays faster than L k , that is., 
Ik = 0(L k ), in which case by Corollary 2, the decay rate of the total error probability at the fusion 
center remains \/N. In the case where logp^" 1 = 0(2 fc / 2 ), we prove the claim by contradiction. We 
assume that logP^ 1 = o(y/N). Therefore, we can write L k = P\ gN > 2~ c2k/2 for all c > 0. Moreover, 
there exists c\ such that q k < 2p k < 2~ Cl2k ' 2 . In this case the ratio Lfc +2 /L| is upper bounded: 



Lk+2 < , Qk Qk+i Qk gfc+i L - 

L\ ~ (l + q k )L 2 k (l+q k+1 )Ll (1 + q k ) (1 + q k+1 ) k 

2 -c l 2 k ' 2 + 2-Ci2< fc + 1 '/ 2 + 2 -Ci2 fc / 2 2-Ci2< fe + 1 )/ 2 

< H 



-2 



LI 



< 1 + 3- 



2 -e 1 2 fc / 2 



T 2 

k 



Because L k > 2~ c2k/2 for all c > 0, we have Lfc +2 /L| < 4. Using the same analysis as that of 
Theorem |2j we can show that logP^ 1 = Q(\/~N), which contradicts with the assumption. Hence, we 
conclude that if logp^ 1 = Q(2 fc / 2 ), then the decay rate of the total error probability remains y/~N, i.e., 
log P' 1 = G(y/N). 

Next we show that if logp^ 1 = o{2 k / 2 ), then logP^ 1 = o{y r N). This claim is also proved by 
contradiction. Suppose that the local failure probability does not decay sufficiently fast, more precisely, 
l°gPfc 1 = o(2 fc / 2 ) and the decay rate of the total error probability remains y^N. For sufficiently large k 
we have 

Qk+l =?& + (!- Qk)Pk+i > Pfe+l/2. 
Therefore we can write q k > 2~ c2,k/2 for all c > 0, in which case the ratio Lk+2/L 2 , is lower bounded: 

^ > , Qk w qk+1 M k + hr l 

2-c 1 2 fc / 2 2 -c 2 2<' t + 1 )/ 2 2 - 2fc/2 ( c i+v / 2 c 2) 

> A(a k + (3 k ) = 4(a fc + /3 fc ) ' (6) 
for all positive c\ and c% However, from the assumption that logP^ 1 = @(\/~N), we have L k < 2~ C32k/2 
for sufficiently large k, where C3 is a positive constant. In consequence, we have shown the ratio Q is 
not bounded above and L k+2 / L k — > 00. Therefore, the decay rate of the total error probability cannot 
remain yN and this rate is dominated by that of the non-failure case, i.e., logP^ 1 = o(yN). 



The above proposition tells us that the decay exponent of the total error probability remains y N if and 
only if the local failure probability decays to sufficiently fast. For illustration purposes, in Figs. (8ja) 
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and (b) we plot the total error probability Pjy versus the number N of sensors and log log versus 
log N, respectively. We set the prior probability P(Hq) = 0.4 and the local failure probability po = 0.1. 
As shown in Figs. [8ja) and (b), the solid (black) lines represent the total error probability curves in the 
non-failure case. The dashed (red) lines represent the total error probability curves in the failure case 
where the local failure probabilities decay quadratically, i.e., Pk+i = pi- This corresponds to a special 
case where qk < L\. for sufficiently large k, for which the decay rate remains y/N. The dotted (blue) 
lines represent the total error probability curves in the failure case where the local failure probabilities 
are identical, i.e., Pk+i = Pk- This corresponds to a case where > 0.05 for all k, for which the decay 
rate is strictly smaller than y/N. The plots are illustrative of the differences in decay rates as reflected 
by our analytical results. 

In the non-failure case and the quadratically decaying case described above, we have logP^ 1 = 
@(y/N), which means that there exist positive constants c\ and c 2 such that c\y/~N~ < \ogP^ 1 < C2VN . 
Therefore, we have 

log ci + - log N < log log Pj^ 1 < log c 2 + - log N. 

Notice that in Fig. [8jb) for sufficiently large log N (> 8), the slopes for the non-failure case and the 
quadratically decaying case are approximately 1/2, consistent with the bounds above. 

V. Concluding Remarks 

We have studied the detection performance of balanced binary relay trees with node and link failures. 
We have shown that the decay rate of the total error probability is 0(y/~N), which cannot be faster than 
that of the non-failure case. We have also derived upper and lower bounds for the total error probability 
at the fusion center as functions of N in the case where the silence probabilities decay to sufficiently 
fast. These bounds imply that the total error probability converges to sub-exponentially with exponent 
y/N. Compared to balanced binary relay trees with no failures, the step-wise shrinkage of the total error 
probability in the failure case is slower, but the scaling law of the asymptotic decay rate remains the 
same. By contrast, if the silence probabilities do not decay to sufficiently fast, then the decay rate in 
the failure case is strictly smaller than that in the non-failure case. 

Future work includes a number of topics. One of them is to understand the detection performance for 
other architectures with node and link failures, including Af-ary relay trees [36] and tandem networks 
|[39l -[431. For example, in general tree structures, if each node has degree more than 2, then the system 
is more robust to node and link failures than binary trees. Moreover, we expect that our techniques can 
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Fig. 8. (a) Total error probability Pn versus the number N of sensors, (b) Plot of log log P^ 1 versus logiV. Solid (black) lines 
represent the non-failure case. Dashed (red) lines represent the case where the local failure probabilities decay quadratically, 
i.e., pk+i = pi- Dotted (blue) lines represent the case where the local failure probabilities are identical, i.e., pk+i = Pk- 



be used to characterize the relationship between the decay rate of the local failure probability and the 
decay rate of the total error probability. 

Our assumption that the sensors make (conditionally) independent observations is restrictive and will 
often be violated. The correlated sensor scenario has been investigated in the parallel configuration fi4l - 
ll46l . The case of correlated sensor observations in tree networks is still open. In this paper, we have 
modeled the failure -prone links by binary erasure channels. Some other interesting models considered 
only in the parallel configuration include binary symmetric channels, Rayleigh fading channels |fl9H . and 
fading multiple-access channels [20], [21]. Other than the node and link failures considered in this paper, 
it would be of interest to characterize the impact of malicious byzantine nodes H71 . which intentionally 
report false information upward in the tree network. 

Appendix A 
Proof of Proposition Q] 

B and Ru share the same lower boundary j3 = a. Thus, it suffices to prove that the upper boundary 
of B lies below that of Ru for a fixed q, i.e., B q lies above R u in the (a,/3) plane. 
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The upper boundary of B is given by 

_ -q + yjq 2 + (1 - g) 2 (2a - a 2 ) + 2g(l - qfc 
1-q 

The upper boundary of Ry is given by 

(3 = -a + 2^ q2 + {1 - q2)a ^. 

1-q 

We need to prove the following: 

-q + V<1 2 + (1 " 9) 2 (2a - a 2 ) + 2q(l - q)a ^ Jq 2 + (1 - 9 2 )a - q 

< —a + z- 



1-g " 1-g 

The above inequality can be simplified as follows: 

^Jq 2 + (1 - <?) 2 (2a - a 2 ) + 2q(l - q)a < -a(l -q)-q + 2^J q 2 + (1 - q 2 )a. 

Squaring both sides and simplifying, we have 

2^q 2 + (l-q 2 )a(a(l - q) + q) < 2(q 2 + (1 - q 2 )a) - (1 - q) 2 (a - a 2 ). 

Again squaring both sides and simplifying, we have 

A{q 2 + {l-q 2 ) a ){(l-q)a + q) 2 < 

4(q 2 + (1 - q 2 )a) 2 + (1 - q)\a - a 2 ) 2 - 4(q 2 + (1 - q 2 )a)(l - q) 2 (a - a 2 ), 
which can be simplified as follows: 

4{q 2 + (1 - q 2 )a){q 2 + 2q{\ - q)a + (1 - qfa 2 - q 2 - (1 - q 2 )a + (1 - q) 2 (a - a 2 )) 
< {1 - q)\a - a 2 ) 2 . 

Fortuitously, the left-hand side turns out to be identically 0. Thus, the inequality holds. 

Appendix B 
Proof of TheoremQ] 

From the assumptions that q^ is monotone non-increasing and (ao> A), <Zo) £ ^> we snan see that the 
dynamic system stays inside R. First we show the following inequality: 

f >- \ 

The evolution of the system is 

(aik,f3k,qk) — > {Oik+l, Pk+l, qk+l) (a-k+2, Pk+2, qk+2) ■ 
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From Proposition 4J we have L k °J_ 2 < L k+ 2, where = 2a k+ \ — ci\ +1 + as defined before. To 
prove Lfc +2 /L| > 1/2, it suffices to show that L^7 2 /L^ > 1/2. We divide our proof into two cases: 
(a k , f3 k , Qk) G R U \B and (a k , f) k , q k ) G B. 
Case I. If (a k , /3 k ,q k ) £ R U \B, then 

L l+2 _ 2a k+l ~ Ofc+l + 



From the recursion Q, we have 



1 — q k , o\ 2q k 
Q-k+i = ^— (2afc - a k ) + — a fc > a k 

i- + qk 1 + qk 



and 



Thus, it suffices to show that 



a _ 1 Qk R 2 2q k „ ^ o2 

Pk+1 - r— Pk + 7~i Pk > Pk- 

l + q k 1 + qk 



2a k -a 2 k + Pi > 1 



(a k + f3 k ) 2 -2" 

It is easy to see that 2(2afc — a\) > 1 — (1 — a k ) . Hence, it suffices to show that 

(l-(l-a fe ) 4 + ^)>(a fc + /3 fc ) 2 , 



which has been proved in [34]. 

Case II. If (a k ,f3 k ,q k ) G B, then it suffices to show that 

„2 1 00 #2 



Again from Q, we have 



atn + 2fo +1 -/% +1 1 
(a fc + /3 fc ) 2 -2' 



1 — q k 2 2^ 

afc+i = ^— (2«fc - a k ) + — a k > a k 

1 + Qk 1 + Qk 



and 



o _ 1 ^fc #2 , 2 Qk o R 2 

Pk+1 ~ T——Pk + -1 1 ~ Pk > P k - 
L + Qk 1 + Qk 

Thus, it suffices to prove that 

(« fc + A) 2 - 2' 

which is obvious. This proves ([7]). We now prove the claim of Theorem[l] From ([7]), we have L k+2 = a k L 
for k = 0, 1, . . . , log iV — 2 and some a k > 1/2. Therefore, for k = 2, 4, ... , log iV, we have 

2 2 (fc ~ 2)/2 r 2 fc/2 

-^fc — 0(fc_ 2 )/2 • a (fc-4)/2 " " " a L ) 
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where Oj > 1/2, i = 0, 1, . . . , (k — 2)/2. Taking logs and using k = log iV, we have 

logP/ = - loga (fc _ 2) /2 - 21oga (fe _ 4)/2 2^ 2 ^ 2 log a + ViVlogL \ 

Notice that logZ-Q 1 > and loga^ > —1 for all i. Thus, 

logP/ < ^logLo 1 + = VN(logL 1 + 1) . 

Appendix C 
Proof of Proposition [5] 

The lower bound of L k+ 2 / B\ has been proved in the proof of Theorem [I] Here we derive the upper 

bound for L k+ 2/L\. Again we divide our proof into two cases: (a k ,/3 k , q k ) £ Ru\B and (a k , (3 k , q k ) € B. 

Case I. If (a k , f3 k ,q k ) G R u \ B, then 

L k+2 < _ 1 - q fc 2a fc - a 2 . + /3 2 | 2g^ 

L 2 - Lf, ~ l + q k (a k + (3 k ) 2 (1 + q k )(a k + fi k )' 

Since < CL k , the second term on the right-hand side of ([8]) is upper bounded as 

2qfc 

(1 + q k )(a k + P k ] 

We now show the other term is bounded above, namely, 



< 2C. 



2ak - a * + ^<4C + 2. (9) 



(a* + A) 2 
tfc _ v 4C + l)/? 2 

-2(4C + l)/3 fc - 2(4C + 2)a fc < 0. 



Let <f>(a k , fa) := 2a k - (4C + 3)a 2 _ (4C + 1 )#? - 2(4C + 2)a k f3 k < 0. We have 



Thus, the maximum of (j) is on the line a k + (3 k = q k /C and the upper boundary of B. If a k +/3 k = q k /C, 
then we have 

2a k -al + Pl _ 2{% - fo) + | (2fo - g) 
(a* + A) 2 {q k /Cy 
The partial derivative of the above term with respect to fi k is non-positive. Therefore, the maximum lies 
on the intersection of a k + f3 k = q k jC and the upper boundary of B. Hence, it suffices to show ((9]) on 
the upper boundary of B, which is given by 

_ -q + yV + (1 ~ q) 2 {2a - a 2 ) + 2q{\ - q)a 
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Let (f(a,q) := y/q 2 + (1 - q) 2 (2a - a 2 ) + 2q(l - q)a. We have 

<P( a k, Pk) = - (ql + Ok + 0- _ qk) 2 {2a k - a\) + 2q k (l - q k )a k - 2q k (p(a k , q k ))/ (1 - q k f 
+ 2a k - {AC + 3)a 2 k - ACf k - 2(AC + 2)a k f3 k 

{AC + 2)a 2 k - AC Pi - 2{AC + 2)a k p k . 



2q k (3 k 2q k a k 2 2 



1 - q k 1 - q k 

Since q k < C(afc + (3 k ) and is non-positive. This proves ([9]). Moreover, we have (1 — q k ) /(l + 

< 1, which combined with (|9]), gives 

1 - q k 2a k -al + Pl <4C + 2 
l + Qk (a k + A) 2 ~~ 

Thus, we have L k+2 /L 2 k < 6C + 2. 

Case II. We now show that L k+2 /L 2 < 6C + 2 for the case where (a k ,f3 k ,q k ) £ 5, From Proposition 
l^jwe have 4+2 ^ L k+2 , where 4+2 denotes the total error probability if we use q k to calculate L k+2 
from L k+ i. Therefore, it suffices to prove that 

4+2 - ( 6C + 2 ) L l = «fc+2 + " (6C + 2)(a fc + /3 fc ) 2 < 0. 

We have 

a _ 1 ~ Qk q2 

Pfc+i - — p fe + — pV 

i + qk l + % 

Since % < CL k , we have /3& > q k /(2C) and 

^ = ^^A + T ^<(6C + 2)/ 3fc . 
a/3 fe 1 + q k 1 + % 

From recursion Q, we have 

J- + qk 1 + qk t + qk ± + qk 

Therefore, 

d/h+2 _ 1 ~ gfc q dftfc+i 2 g/3 fc+ i 

Consequently, 



a(4^-(6C + 2)L 2 



< 2(6C + 2)/3 fc - 2(6C + 2)a fc - 2(6C + 2)f3 k < 0. 



We can consider the line a k +(3 k = q k /C and the lower boundary of B, which is given by f3 k = a k . With a 
similar argument, the maximum can be shown to lies on the intersection of a k + f3 k = q k /C and the lower 
boundary of B. Moreover, we know that if (3 k = a k , then L k+ i = L k and (a k +i, P k +i) lies on the lower 
boundary of Rc- Following a similar argument to Case I, we arrive at L k+2 /L 2 = L k+2 / L k+l < 6C + 2. 
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Appendix D 
Proof of Proposition [6] 



To prove L k+ i/Lt > 1, it suffices to show that 



— L% = — ^ (2a fc - a 2 + $ - (a fc + /3 fc ) 2 ) + — ^- (a fc + & - (a fc + (3 k ) 2 ) > 0. (10) 

i + qk i + % 



Note that the second term on the right-hand side of ( fT0[ ) is non-negative. The first term can be written as 

2a k -al + ffi- (a k + faf = 2a k {\ - (a k + fa)) > 0, 

which is also positive. 

To prove L k+ i/L k < 1, it suffices to show that 

L k +i ~ L k = ] : 9fc (2«fc - af. + /3| - (a k + (3 k )) + 2<lk (a k + /3 k - (a k + f3 k )) < 0, 

which is easy to see because the first term is non-positive. 

Appendix E 
Proof of Theorem[3] 

From Proposition [6j we have L\ = HLq for some a > 1. And, by Proposition |5j the following identity 
holds: L k+2 = a k L\ for k = 1, 3, ... , log N — 2 and some G [1/2, 6C + 2]. Hence, we can write 

L k — a " a (fe-l)/2 • a (fe_3)/2 " " " a l ^0 > 

where aj G [1/2, 6C + 2] for each i = 1, 3, . . . , (k — l)/2, and a > 1. Taking logs and using k = log AT, 
we have 

log P^ 1 = -2( fe " 1 )/ 2 logS - log a (fc _ 1)/2 2( fe - 3 V 2 log a x + ^log L Q \ 

Notice that logLp 1 > and logaj > — 1 for all i. Moreover, logo > 0. Hence, 

logP^ 1 < ^(bgL^ + l). 

We now establish the lower bound. It follows from Proposition [6] that L k = aL k _i for some a G (0, 1]. 
By Proposition [5] we have L k+2 = a k L\ for k = 0, 2, ... , log — 3 and some a k G [1/2, 6C + 2]. Thus, 

L k — a • a(fc_3)/2 • 0(fc_3)/2 ' ' ' °o ^0 > 

where a* G [1/2, 6C + 2] for each i = 0, 2, . . . , (k - 3)/2, and a G (0, 1]. Hence, 

log P N 1 = -loga- log o (J fe_i )/2 2( fc " 3 )/ 2 log 01 + ^log L \ 

Notice that logLp 1 > and —1 < logaj < log(6C + 2) for all i, and logo < 0. Thus, 

logP^ 1 > -log(6C + 2)y^+ y^logLo 1 = ^(logLo 1 - log(6C + 2)) . 
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